Convex Formulation of Multiple Instance Learning from Positive and Unlabeled Bags

نویسندگان

Han Bao

Tomoya Sakai

Issei Sato

Masashi Sugiyama

چکیده

Multiple instance learning (MIL) is a variation of traditional supervised learning problems where data (referred to as bags) are composed of sub-elements (referred to as instances) and only bag labels are available. MIL has a variety of applications such as content-based image retrieval, text categorization, and medical diagnosis. Most of the previous work for MIL assume that training bags are fully labeled. However, it is often difficult to obtain an enough number of labeled bags in practical situations, while many unlabeled bags are available. A learning framework called PU classification (positive and unlabeled classification) can address this problem. In this paper, we propose a convex PU classification method to solve an MIL problem. We experimentally show that the proposed method achieves better performance with significantly lower computation costs than an existing method for PU-MIL.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ellipsoidal Multiple Instance Learning

We propose a large margin method for asymmetric learning with ellipsoids, called eMIL, suited to multiple instance learning (MIL). We derive the distance between ellipsoids and the hyperplane, generalising the standard support vector machine. Negative bags in MIL contain only negative instances, and we treat them akin to uncertain observations in the robust optimisation framework. However, our ...

متن کامل

Multiple-Instance Active Learning

We present a framework for active learning in the multiple-instance (MI) setting. In an MI learning problem, instances are naturally organized into bags and it is the bags, instead of individual instances, that are labeled for training. MI learners assume that every instance in a bag labeled negative is actually negative, whereas at least one instance in a bag labeled positive is actually posit...

متن کامل

CS395T Data Mining Project report One-class SVM formulations for Multiple Instance learning

Multiple Instance learning (MIL) considers a particular form of weak supervision in which the learner is given a set of positive bags and negative bags. Positive bags are sets of instances containing atleast one positive example and negative bags are sets of instances all of which are negative. A number of binary SVM based solutions have been proposed to this problem like the Normalized Set Ker...

متن کامل

Multiple Instance Learning for Computer Aided Diagnosis

Many computer aided diagnosis (CAD) problems can be best modelled as a multiple-instance learning (MIL) problem with unbalanced data: i.e. , the training data typically consists of a few positive bags, and a very large number of negative instances. Existing MIL algorithms are much too computationally expensive for these datasets. We describe CH, a framework for learning a Convex Hull representa...

متن کامل

Risk Minimization Framework for Multiple Instance Learning from Positive and Unlabeled Bags

Han Bao The University of Tokyo, 113-0033 Tokyo, Japan [email protected] Tomoya Sakai The University of Tokyo, 277-8561 Chiba, Japan RIKEN Center for AIP, 103-0027 Tokyo, Japan [email protected] Masashi Sugiyama RIKEN Center for AIP, 103-0027 Tokyo, Japan The University of Tokyo, 277-8561 Chiba, Japan [email protected] Issei Sato The University of Tokyo, 277-8561 Chiba, Japa...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Convex Formulation of Multiple Instance Learning from Positive and Unlabeled Bags

نویسندگان

چکیده

منابع مشابه

Ellipsoidal Multiple Instance Learning

Multiple-Instance Active Learning

CS395T Data Mining Project report One-class SVM formulations for Multiple Instance learning

Multiple Instance Learning for Computer Aided Diagnosis

Risk Minimization Framework for Multiple Instance Learning from Positive and Unlabeled Bags

عنوان ژورنال:

اشتراک گذاری